Comparison of data augmentation methods for legal document classification
نویسندگان
چکیده
Sorting out the legal documents by their subject matter is an essential and time-consuming task due to large amount of data. Many machine learning-based text categorization methods exist, which can resolve this problem. However, these algorithms not perform well if they do have enough training data for every category. Text augmentation Data a widely used technique in learning applications, especially computer vision. Textual has different characteristics than images, so solutions must be applied when need arises. type textual or itself may reduce number that could certain scenario. This paper focuses on classifying them into specific groups matters.
منابع مشابه
data mining rules and classification methods in insurance: the case of collision insurance
assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...
15 صفحه اولA Comparison of Methods for Web Document Classification
WebDoc is an automated classification system that assigns Web documents to appropriate Library of Congress subject headings based upon the text in the documents. We have used different classification methods in different versions of WebDoc. One classification method is a statistical approach that counts the number of occurrences of a given noun phrase in documents assigned to a particular subje...
متن کاملData Augmentation for Plant Classification
Data augmentation plays a crucial role in increasing the number of training images, which often aids to improve classification performances of deep learning techniques for computer vision problems. In this paper, we employ the deep learning framework and determine the effects of several data-augmentation (DA) techniques for plant classification problems. For this, we use two convolutional neura...
متن کاملLeveraging Document Structure for Better Classification of Complex Legal Documents
Document classification is a machine learning application that has been as impactful as it has been successful in a myriad of domains and applications. However, when the documents being classified are large and highly-complex, and when the set of potential classes is large as well, these models could be improved by incorporating more information about the documents’ overall structure. Most appr...
متن کاملAn Augmentation Hybrid System for Document Classification and Rating
This paper introduces an augmentation hybrid system, referred to as Rated MCRDR. It uses Multiple Classification Ripple Down Rules (MCRDR), a simple and effective knowledge acquisition technique, combined with a neural network. Introduction As we move from the Information Age to the Age of Information Overload, Information Filtering (IF) has gained significant attention in the research communit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Acta technica Jaurinensis
سال: 2021
ISSN: ['1789-6932', '2064-5228']
DOI: https://doi.org/10.14513/actatechjaur.00628